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Abstract — We present a novel modulation level classifica- 
tion (MLC) method based on probability distribution distance 
functions. The proposed method uses modified Kuiper and 
Kolmogorov-Smirnov distances to achieve low computational 
complexity and outperforms the state of the art methods based 
on cumulants and goodness-of-flt tests. We derive the theoretical 
performance of the proposed MLC method and verify it via 
simulations. The best classification accuracy, under AWGN with 
SNR mismatch and phase jitter, is achieved with the proposed 
MLC method using Kuiper distances. 

I. Introduction 

Modulation level classification (MLC) is a process which 
detects the transmitter's digital modulation level from a re- 
ceived signal, using a priori knowledge of the modulation 
class and signal characteristics needed for downconversion and 
sampling. Among many modulation classification methods |]J, 
a cumulant (Cm) based classification |2| is one of the most 
widespread for its ability to identify both the modulation class 
and level. However, differentiating among cumulants of the 
same modulation class, but with different levels, i.e. 16QAM 
vs. 64QAM, requires a large number of samples. A recently 
proposed method |3| based on a goodness-of-fit (GoF) test 
using Kolmogorov-Smirnov (KS) statistic has been suggested 
as an alternative to the Cm-based level classification which 
require lower number of samples to achieve accurate MLC. 

In this letter, we propose a novel MLC method based on 
distribution distance functions, namely Kuiper (K) Q |j5] Sec. 
3.1] and KS distances, which is a significant simplification 
of methods based on GoF. We show that using a classifier 
based only on K-distance achieves better classification than 
the KS-based GoF classifier. At the same time, our method 
requires only 2AIL additions in contrast to 2A/(log2Af +2K) 
additions for the KS-based GoF test, where K is the number of 
distinct modulation levels, M is the sample size and L <C M 
is the number of test points used by our method. 

II. Proposed MLC Method 

A. System Model 

Following we assume a sequence of AI discrete, 
complex, i.i.d. and sampled baseband symbols, s^^*^^ = 
[s^*^"* • • • sjj^''], drawn from a modulation order Mk G 
{Ml, ■ ■ ■ , Mk}, transmitted over AWGN channel, perturbed 
by uniformly distributed phase jitter and attenuated by an 
unknown factor A > 0. Therefore, the received signal is given 
as r = [ri • • -rji/], where r„ = Ae^*"s„ + g„, {.g„}*ii 
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CAA(0,cr2) and {4>„}*Li ^ U {-(t), The task of the 
modulation classifier is to find A4k, from which s'^'') was 
drawn, given r. Without loss of generality, we consider unit 
power constellations and define SNR as 7 = A^/a^. 

B. Classification based on Distribution Distance Function 

The proposed method modifies MLC technique based on 
GoF testing using the KS statistic |3|. Since the KS statistic, 
which computes the minimum distance between theoretical 
and empirical cumulative distribution function (ECDF), re- 
quires all CDF points, we postulate that similarly accurate 
classification can be obtained by evaluating this distance using 
a smaller set of points in the CDF. 

Let z = [zi ■ ■ ■ zn] = /(r) where /(•) is the chosen feature 
map and N is the number extracted features. Possible feature 
maps include |r| (magnitude, N = M) or the concatenation of 
5R{r} and 5{r} (quadrature, N = 2M). The theoretical CDF 
of z given Aik and 7, Fq{z), is assumed to be known a priori 
(methods of obtaining these distributions, both empirically and 
theoretically, are presented in |[3] Sec. III-A]). The K CDFs, 
one for each modulation level, define a set of test points 

ig) =argmaxi?y(z), (1) 
with the distribution distances given by 

D',f{z) = {-ir[F!,{z)~F^{z)), (2) 

for I < i, j < K, i ^ j, and e E {0, 1}, corresponding to the 
maximum positive and negative deviations, respectively. Note 
the symmetry in the test points such that t^'f^ — t\^\ Thus, 
there are L = 2(^) test points for a K order classification. 
The ECDF, given as 

1 ^ 

FN{t)^-J2K^n<t), (3) 
n=l 

is evaluated at the test points to form F^r = {^-/v(4j^)}' 
^ ^ hj ^ K,i ^ j. Here, I(-) equals to one if the input is 
true, and zero otherwise. By evaluating F/v (t) only at the test 
points in ([T]i, we get 

- i-iy (Fn (i^) - Fi {t<ff)) (4) 

which are then used to find an estimate of the maximum 
positive and negative deviations 

^?^ = , max i<J<K, (5) 

of the ECDF to the true CDFs. The operation of finding the 
ECDF at the given testpoints (|4|i can be implemented using 



2 



a simple thresholding and counting operation and does not 
require samples to be sorted as in [3|. The metrics in ^ are 
used to find the final distribution distance metrics 
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Number of Operations and Memory Usage 
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which are the reduced complexity versions of the KS distance 
(rcKS) and the K distance (rcK), respectiveljQ Finally, we 
use the metrics in ^ as substitutes to the true distance-based 
classifiers with the following rule: choose such that 



are; min D,-, ky 



arff min V.i . 
i<j<K ■' 
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In the remainder of the letter, we define /i^(Fjv) = fc_D and 
hy{FM) = ky, where ko, ky & {1, . . . , K}. 

C. Analysis of Classification Accuracy 

Let t = [ti • • • ii] denote the set of test points, {^j^}' sorted 
in ascending order. For notational consistency, we also define 
the following points, to = — oo and fL+i — +oo. Given that 
these points are distinct, they partition z into L + 1 regions. 
An individual sample, z„, can be in region I, such that < 
Zn < ti, with a given probability, determined by Fq{z). 

Assuming z„ are independent of each other, we can con- 
clude that given z, the number of samples that fall into each 
of the L + 1 regions, n = [ni • • • n^+i], is jointly distributed 
according to a multinomial PMF given as 



/(n|iV,p) 



Arii,"i...„"i + 1 L + 1 

, if J:n^^N, 



,0, 



r=l (8) 

Otherwise, 



where p = [pi • • -pi+i], and pi is the probability of an 
individual sample being in region I. Given that z is drawn 
from Mk, Pi = F>;{ti) - F^iU^i), for < / < i + 1. 
Now, with particular n, the ECDF at all the test points is 

1 ' 

Fjv(n) ^ [Fjv(ti)---Fw(ti)], Fjv(<0 = ^I]".- (9) 

1=1 

Therefore, we can analytically find the probability of classifi- 
cation to each of the K classes as 

Pr(fc = K\Mk) = J2 I(V(Fiv(n)) - ^)/(n|7V,p), (10) 

for the rcK classifier. A similar expression can be applied to 
rcKS, replacing hy{-) with hj^{-) in (lOl. 



D. Complexity Analysis 

Given that the theoretical CDFs change with SNR, we store 
distinct CDFs for W SNR values for each modulation level 
(impact of the selection of W on the accuracy is discussed 
further in Section in-B| ) Further, we store KW theoretical 
CDFs of length N each. For the non-reduced complexity 
classifiers that require sorting samples, we use a sorting 
algorithm whose complexity is N log N. From Table |l] we 

'Note, that other non-parametric distances used in hypothesis testing exist 
(see introduction in e.g. |4|), although for brevity they are not addressed here. 
We note, however, that our approach is easily applied to any assumed distance 
metric. 
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see that for ii' < 3 rcK/rcKS tests use less addition operations 
than K/KS-based methods |3J and Cm-based classification |2J. 
For K > 3, the rcK method is more computationally efficient 
when implemented in ASIC/FPGA, and is comparable to Cm 
in complexity when implemented on a CPU. In addition, the 
processing time would be shorter for an ASIC/FPGA imple- 
mentation, which is an important requirement for cognitive 
radio applications. Furthermore, their memory requirements 
are also smaller since N has to be large for a smooth 
CDF. It is worth mentioning that the authors in f3] used 
the theoretical CDF, but used N as the number of samples 
to generate the CDF in their complexity figures. The same 
observation favoring the proposed rcK/rcKS methods holds 
for the magnitude-based (mag) classifiers ||3| Sec III-A]. 

III. Results 

As an example, we assume that the classification task is 
to distinguish between M-QAM, where M e {4, 16, 64}. 
For comparison we also present classification result based on 
maximum likelihood estimation (ML). 

A. Detection Performance versus SNR 

In the first set of experiments we evaluate the performance 
of the proposed classification method for different values of 
SNR. The results are presented in Fig. [T] We assume fixed 
sample size of M = 50, in contrast to [3, Fig. 1] to evaluate 
classification accuracy for a smaller sample size. We confirm 
that even for small sample size, as shown in fT, Fig. 1], Cm has 
unsatisfying classification accuracy at high SNR. In (10,17) dB 
region rcK clearly outperforms all detection techniques, while 
as SNR exceeds «17dB all classification methods (except 
Cm) converge to one. In low SNR region, (0,10) dB, KS, 
rcKS, rcK perform equally well, with Cm having comparable 
performance. The same observation holds for larger sample 
sizes, not shown here due to space constraints. Note that the 



analytical performance metric developed in Section II-C for 
rcK and rcKS matches perfectly with the simulations. For the 
remaining results, we set 7 = 12 dB, unless otherwise stated. 

B. Detection Performance versus Sample Size 

In the second set of experiments, we evaluate the perfor- 
mance of the proposed classification method as a function of 
sample size M. The result is presented in Fig. [2] As observed 
in Fig. [T] also here Cm has the worst classification accuracy, 
e.g. 5% below upper bound at M = 1000. The rcK method 
performs best at small sample sizes, 50 < M < 300. With 
M > 300, the accuracy of rcK and KS is equal. Classification 
based on rcKS method consistently falls slightly below rcK 
and KS methods. In general, rcKS, rcK and KS converge to 
one at the same rate. 
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Fig. 1 . Effect of varying SNR on the probability of classification with J\/=50; 
(an.) - analytical result using jlOj . 
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Fig. 2. Effect of varying sample size on the probability of classification with 
7 = 12 dB. 



C. Detection Performance vs SNR Mismatch and Phase Jitter 

In the third set of experiments we evaluate the performance 
of the proposed classification method as a function of SNR 
mismatch and phase jitter. The result is presented in Fig. [3] 
In case of SNR mismatch. Fig. |3(a)[ our results show the 
same trends as in [3, Fig. 4]; that is, all classification methods 
are relatively immune to SNR mismatch, i.e. the difference 
between actual and maximum SNR mismatch is less than 
10% in the considered range of SNR values. This justifies the 
selection of the limited set of SNR values W for complexity 
evaluation used in Section [Il-D| As expected, ML shows very 
high sensitivity to SNR mismatch. Note again the perfect 



match of analytical result presented in Section II-C with the 
simulations. 

In the case of phase jitter caused by imperfect downconver- 
sion, we present results in Fig. 3(b) for ■y — 15 dB as in Q, in 
contrast to 7 = 12 dB used earlier, for comparison purposes. 
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Fig. 3. (a) Effect of SNR mismatch, nominal/true SNR=12dB; (b) effect of 
phase jitter, nominal SNR=15dB.; (an.) - analytical result using | |1Q) , (mag) 
- magnitude. 



We observe that our method using the magnitude feature, 
rcK/rcKS (mag), as well as the Cm method, are invariant 
to phase jitter. rcK and rcKS perform almost equally well, 
while Cm is worse than the other three methods by sslO%. 
As expected, the ML performs better than all other methods. 
Quadrature-based classifiers, as expected, are highly sensitive 
to phase jitter. Note that in the small phase jitter, < 10°, 
quadrature-based classifiers perform better than others, since 
the sample size is twice as large as in the former case. 

IV. Conclusion 

In this letter we presented a novel, computationally efficient 
method for modulation level classification based on distribu- 
tion distance functions. Specifically, we proposed to use a 
metric based on Kolmogorov-Smirnov and Kuiper distances 
which exploits the distance properties between CDFs corre- 
sponding to different modulation levels. The proposed method 
results in faster MLC than the cumulant-based method, by 
reducing the number of samples needed. It also results in 
lower computational complexity than the KS-GoF method, by 
eliminating the need for a sorting operation and using only a 
limited set of test points, instead of the entire CDF. 
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